Fault injection testing of software implemented fault tolerance mechanisms of distributed systems
نویسنده
چکیده
One way of gaining confidence in the adequacy of fault tolerance mechanisms of a system is to test the system by injecting faults and see how the system performs under faulty conditions. This thesis investigates the issues of testing software-implemented fault tolerance mechanisms of distributed systems through fault injection. A fault injection method has been developed. The method requires that the target software system be structured as a collection of objects interacting via messages. This enables easy insertion of fault injection objects into the target system to emulate incorrect behaviour of faulty processors by manipulating messages. This approach allows one to inject specific classes of faults while not requiring any significant changes to the target system. The method differs from the previous work in that it exploits an object oriented approach of software implementation to support the injection of specific classes of faults at the system level. The proposed fault injection method has been applied to test software-implemented reliable node systems: a TMR (triple modular redundant) node and a fail-silent node. The nodes have integrated fault tolerance mechanisms and are expected to exhibit certain behaviour in the presence of a failure. The thesis describes how various such mechanisms (for example, clock synchronisation protocol, and atomic broadcast protocol) were tested. The testing revealed flaws in implementation that had not been discovered before, thereby demonstrating the usefulness of the method. Application of the approach to other distributed systems is also described in the thesis.
منابع مشابه
Focused fault injection testing of software implemented fault tolerance mechanisms of Voltan TMR nodes
One way of gaining confidence in the adequacy of fault tolerance mechanisms of a system is to test the system bv iniectina faults and see how the system performs under faulty conditions: This paper presknts an application of the focused fault inject'on method that has been developed lor testing software implemented fault tolerance mecnanisms of distributed systems. The method exploits tne objec...
متن کاملFault injection testing method of software implemented fault tolerance mechanisms of web service systems
متن کامل
Comparison of Physical and Software-Implemented Fault Injection Techniques
This paper addresses the issue of characterizing the respective impact of fault injection techniques. Three physical techniques and one software-implemented technique that have been used to assess the fault tolerance features of the MARS faulttolerant distributed real-time system are compared and analyzed. After a short summary of the fault tolerance features of the MARS architecture and especi...
متن کاملVerification of Fault Tolerance by Means of Fault Injection into VHDL Simulation Models*
This paper describes and illustrates the application of a strategy for testing fault tolerance by means of fault injection. The proposed approach encompasses a method for modelling fault tolerance together with a strategy for testing several classes of fault tolerance mechanisms. The modelling approach distinguishes two main abstraction levels: (i) a data flow diagram of the general organisatio...
متن کاملUsing Virtualization to Validate Fa Ult-tolerant Distributed Systems
Asynchronous events and complex system state distributed across independent nodes make exposure and diagnosis of flaws in distributed systems a challenge. The difficulties are exacerbated when the goal is to validate fault tolerance mechanisms that are activated only by the occurrence of errors, which are, by nature, rare. Validation of fault tolerance mechanisms is often done by injecting faul...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996